Predictive effects of structural variation on citation counts

نویسنده

  • Chaomei Chen
چکیده

A critical part of a scientific activity is to discern how a new idea is related to what we know and what may become possible. As the number of new scientific publications arrives at a rate that rapidly outpaces our capacity of reading, analyzing, and synthesizing scientific knowledge,we need to augment ourselveswith information that can effectively guide us through the rapidly growing intellectual space. In this article, we address a fundamental issue concerning what kinds of information may serve as early signs of potentially valuable ideas. In particular, we are interested in information that is routinely available and derivable upon the publication of a scientific paper without assuming the availability of additional information such as its usage and citations.We propose a theoretical and computational model that predicts the potential of a scientific publication in terms of the degree to which it alters the intellectual structure of the state of the art. The structural variation approach focuses on the novel boundary-spanning connections introduced by a new article to the intellectual space.We validate the role of boundary-spanning in predicting future citations using three metrics of structural variation—namely, modularity change rate, cluster linkage, and Centrality Divergence—along with more commonly studied predictors of citations such as the number of coauthors, the number of cited references, and the number of pages. Main effects of these factors are estimated for five cases using zero-inflated negative binomial regression models of citation counts. Key findings indicate that (a) structural variations measured by cluster linkage are a better predictor of citation counts than are the more commonly studied variables such as the number of references cited, (b) the number of coauthors and the number of references are both good predictors of global citation counts to a lesser extent, and (c) the Centrality Divergence metric is potentially valuable for detecting boundary-spanning activities at interdisciplinary levels.The structural variation approach offers a new way to monitor and discern the potential of newly published papers in context. The boundaryspanning mechanism offers a conceptually simplified

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predictive validity of editorial decisions at an open access journal: A case study on Atmospheric Chemistry and Physics

In this study we investigate the quality of the selection process of an open access (OA) journal, taking as an example the journal Atmospheric Chemistry and Physics (ACP). ACP is working with a new system of public peer review. We examined the predictive validity of the ACP peer review system – namely, whether the process selects the best of the manuscripts submitted. We have data for 1111 manu...

متن کامل

Predicting Citation Counts Using Text and Graph Mining

As the volume of scientific literature grows faster it becomes more difficult for researchers to identify promising papers that are likely to become influential in their field. We study the problem of predicting future citation counts of papers given information available at the time of publication (five years forward in our pilot study). We apply machine learning techniques on a dataset of mil...

متن کامل

Use of Structure Codes (Counts) for Computing Topological Indices of Carbon Nanotubes: Sadhana (Sd) Index of Phenylenes and its Hexagonal Squeezes

Structural codes vis-a-vis structural counts, like polynomials of a molecular graph, are important in computing graph-theoretical descriptors which are commonly known as topological indices. These indices are most important for characterizing carbon nanotubes (CNTs). In this paper we have computed Sadhana index (Sd) for phenylenes and their hexagonal squeezes using structural codes (counts). Sa...

متن کامل

Citation Counts and Social Comparisons: Scientists’ Use and Evaluation of Citation index Data

Data from samples of biochemists and sociologists show that nearly all are familiar with citation indexes and that the two groups are equally likely to have used a citation index for bibliographic purposes. We develop three hypotheses from social comparison theory to account for variation in use and evaluation of citation counts as indicators of scientific achievement: (1) more highly cited sci...

متن کامل

Predicting citation counts of environmental modelling papers

We assessed all papers published in two key environmental modelling journals in 2008 to determine the degree to which the citation counts of the papers could be predicted without considering the paper’s quality. We applied both random forests and general additive models to predict citation counts using a range of easily quantified or categorised characteristics of the papers as covariates. The ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JASIST

دوره 63  شماره 

صفحات  -

تاریخ انتشار 2012